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Abstract 

In this paper, we study distributed estimation and control problems over graphs under partially nested 
information patterns. We show a duality result that is very similar to the classical duality result between 
state estimation and state feedback control with a classical information pattern, under the condition that 
the disturbances entering different systems on the graph are uncorrelated. The distributed estimation 
problem decomposes into N separate estimation problems, where N is the number of interconnected 
subsystems over the graph, and the solution to each subproblem is simply the optimal Kalman filter. This 
also gives the solution to the distributed control problem due to the duality of distributed estimation and 
control under partially nested information pattern. We then consider a weighted distributed estimation 
problem, where we get coupling between the estimators, and separation between the estimators is not 
possible. We propose a solution based on linear quadratic team decision theory, which provides a 
generalized Riccati equation for teams. We show that the weighted estimation problem is the dual to 
the distributed state feedback problem, where the disturbances entering the interconnected systems are 
correlated. 
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I. Introduction 

A. Background 

Control with information structures imposed on the decision maker(s) have been very challeng- 
ing for decision theory researchers. Even in the simple linear quadratic static decision problem, 
it has been shown that complex nonlinear decisions could outperform any given linear decision 
(see |[T6lO . Important progress was made for the stochastic static team decision problems in [|9) 
and irTTI . New information structures were explored in []7] for the stochastic linear quadratic 
finite horizon control problem. Similar algebraic conditions where given in [0 for homogeneous 
systems. In [12], the stationary state feedback stochastic linear quadratic control problem was 
considered using state space formulation, under the condition that all the subsystems have a 
common past, with the difficulty of recovering the structure of the distributed controller. With 
common past, we mean that all subsystems have information about the global state from some 
time step in the past. The time- varying and stationary output feedback version was solved in [6]. 
Recently, nice studies of Partial Nestedness in linear quadratic dynamic team problems appeared 
in Yuksel ifTTl and Mahajan et al [[8]|. The n-step delay problem is studied in |[T0l . Duality between 
estimation and control for distributed control problems of heterogeneous systems under arbitrary 
sparsity and delay partially nested structure, was explored 0, where state-feedback control and 
estimation was shown to be solved by a set of independent Riccati equations. In particular, A3 
showed that optimal controllers have a finite order for any partially nested information strucutre. 
A state-space solution for iV systems with no delays was given in [[141 with a different approach 
relying on partially ordered set formulation. The work in [?] considers realizable solutions in 
the presence of noise. 

B. Contribution 

In this paper, we will show a duality result between distributed state estimation and distributed 
state feedback control under partially nested information including delays, similar to the cen- 
tralized estimation and state feedback problems. Since the distributed control and estimation 
problems are dual, we show how to find the optimal distributed estimator (and hence the 
optimal distributed state-feedback controller). The distributed estimation problem decomposes 
into iV separate estimation problems, where N is the number of interconnected subsystems of 
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the network. We give an explicit solution for the three interconnected systems' case under 
two different graphs. The paper is an extension of [13 where we consider a more general 
framework. The general framework includes a weighted distributed estimation problem, where we 
get coupling between the estimators, and separation between the estimators is not possible. We 
propose a solution based on linear quadratic team decision theory, which provides a generalized 
Riccati equation for teams. We show that the weighted estimation problem is the dual to the 
distributed state feedback problem, where the disturbances entering the interconnected systems 
are correlated. The solutions do not assume stable systems, and a stabilizing solution is obtained 
automatically when it exists. 

C. Notation 

Let K be the set of real numbers, Z 2 = {0, 1}, S>™ + is the set of n x n positive definite matrices. 
x ~ J\f(m,X) means that x is a Gaussian variable with E{x} = m and E{(x — m)(x — m) T } = 
X. [M]i, denotes the block row or column i of a matrix M depending on the context. For a 
matrix A partitioned into blocks, [A]^ denotes the block matrix of A in block position 
I n is the n x n identity matrix. For vectors i> fc ,i> fc _i, ...,v , we define V[ 0)k ] := {vk,Vk-i, ...,Vo}. 
We denote a discrete-time (stochastic) process x(0), x(l), x(2), ... by {x(t)}. The forward shift 
operator is denoted by q, that is x(t + 1) = qx(t). A causal linear time-invariant operator H(q) 
maps a process {x(t)} to an output y(t), where y(t) = H(q~ 1 )x(t), and i/(q _1 ) is given by 
its generating function ([15]), //(q _1 ) = Em^^"*' h (t) e ^ mxn . The norm of ((//(q^ 1 )!! 
is defined as (^(q^ 1 ) || 2 = E||iJ(q- 1 ) W (t) || 2 = \\h(t)\\ 2 = ^ t = Tr[h T (t)h(t)], where 

{w(t)} is a sequence of uncorrelated Gaussian variables with w(t) ~ A/"(0, 1). A transfer matrix 
in terms of state-space data is denoted 

= C(ql — A)~ l B + D. 

II. Linear Quadratic Team Theory 

Define a probability space (fi, J 7 , V). Let yi be pi -dimensional random variables, for % = 
1,...,N, and set p = p\ + • • -pn- Let T{ be the sigma field generated by y^ Introduce, H, 
the space of all nN x N matrices whose elements are measurable functions from £1 to R. Let 



A 


B 


C 

V 


D 



^6§f + , and define 



< H x , H 2 >= Tr E{iW# 2 T } C 1 ) 
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for Hi, H 2 E H. Then, H is a Hilbert space with inner product ® and norm =< H, H >. 

Let V C H such that for D E V, the ith column of D, D^ is Ti -measurable. 

The columns of D, D 1 , D N , make up a team, where the players Di make decisions in local 
information given by yi, to minimize a cost of the form 

< D - $X D - <$>X > 



for some $ £ 



vnNxnN 



and X EH. 



Proposition 1: Let X E H. The minimum of < D — X, D — X > for D E V is acheived by 
the unique X E V satisfying 

< X -X,D >= 

for all DEV. 

Proof: Consult 0. ■ 
The following proposition gives a certainty equivalence property for team problems: 
Proposition 2: Let 

X = are min < D — X,D - X > . 



Then, 



Dev 



$X = arg min < D - <3>X D - &X > . 

De£> 



Proof: Consult 0. ■ 
Proposition 3: Let Uj be n-dimensional vectors and Lj £ R nx ™, for z = 1,...,N, and $ £ 

DnJVxniV with 

Li • ■ ■ Ln 




Let 



and 



it* = arg min E{(w — Lx) T W(u — Lx)} 



X = arg min < D — X, D - X > 
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Then, 

N 

U * = /Z = 

Proof: Consult 0. 



Definition 1: X is W^-orthogonal to Y if < X, Y >= 0. 

Definition 2: For a sequence {Z k } Zk ev, {Z k } is called Vl^-white noise if < Z k , Zi >= for 
all k^l. 

Note that for W = I, we get the formal definition of white noise in the classical sense. 
Now introduce the matrix 

Y — diag(t/i, • - • ,y N ). 

The next proposition shows how to obtain the linear optimal solution D = KY: 
Proposition 4: Let X E U. The minimum of \\KY - X\\ w over K E R nNx P with KY E V 
is acheived by the unique K* given by 

K* = E{XWY T }(E{YWY T })- 1 . 

Proof: Consult 0. ■ 

III. Systems over Graphs 
Consider linear systems Pj(q _1 ) with state space realization 

N 

Xi(t + 1) = V Ay-x,-(t) + B iUi (t) + Wi(t) 

3=1 (2) 

= CiXi(t) + Vi(t), 

for z = 1, N. Here, E W 1 ^, B { E R n > xm >, and d E W> xn '. w { is the disturbance and u t 
is the control signal, entering system i. Also, we have that J2i m i = m > 2~2i n « = n ' J2iPi = V- 
The systems are interconnected as follows. If the state of system j at time step t (i.e., Xj(t)) 
affects the state of system i at time step t + 1 (i.e., + 1)), then A+j ^ 0, otherwise A+j = 0. 
This block structure can be described by a graphQ Q of order N, whose adjacency matrix is A. 
The graph Q has an arrow from node j to i if and only if Aij ^ 0. The transfer function of 

'See the Appendix for a short introduction to graph theory. 
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the interconnected systems is given by P(q _1 ) = C(q/ — A)^ 1 B. Then, the system P T (q _1 ) 
is equal to B T (c\I — A T )~ 1 C T , and it can be represented by a graph Q* which is the adjoint 
of Q, since the adjacency matrix of Q* is A* = A T . The block diagram for the transposed 
interconnection is simply obtained by reversing the orientation of the interconnection arrows. 
This property was observed in flU. 

For any generating function F(X), we write the generating function 



Definition 3 (Sparsity Structure): Let m,n,N be integers with m, n > N, A G Z 2 x , and 



We say that G(X) has the sparsity structure given by A if G(X) G S^,. 

Theorem 1: Suppose that Gi(X) G S^ xp , G 2 (X) G S p J^ n for a given adjacency matrix A G 
Z% xN . Then G 1 {X)G 2 {X) G S™ xn . 

Proof: Let G 1 (X) = E t >o #i (*) ^ and G 2(A) = Et>o 92(t)X t . Then, G 3 (A) = G 1 (A)G 2 (A) = 
Etx^W^*' wnere <73(X) = Es=o#i( s )#2(£ — s). Let r denote the i:th row of A s and c 
denote the j:th column of A l ~ s . Then A\j = [A s ■ *4*~ s ]r,- = r ■ c. Now = 0, implies 
that r ■ c = 0. Since r and c consist of non-negative integers, we have either r k = or 
Cfc = 0, for all k. In an analog manner, let u denote the i:th block row of gi(s) and v 
the j:th block column of g 2 (t — s). Clearly, Vj = [A s ]ij = implies that Uj = 0, and 
Ci = [A^ t ~ s ' ) ]ij = implies that = 0. Thus, for all k, either u k or v k is zero, that is u k Vk = 0. 
Hence, [gi(s)g 2 (t — s)]y = it • t> = wi^i + • • ■ + unVn = 0, and so [gz{t)]ij = 0. We conclude 



that [A% 3 = [<? 3 (t)]^ = 0, and so G 3 (A) = E*>o^(*)A* G S™ xn - 

■ 

Theorem 2: Let .4 be a given adjacency matrix, H X {X) G S^ xn , and H 2 {X) G S^ xn . Then 
/J 2 (A)(/-^ 1 (A))- 1 G5X xri . 

Proof: Let # 3 (A) = # 2 (A)(J - ^i(A))" 1 G S^*™- The formal power series of H 3 (X) is 



G(A) = (/-F(A))- 1 = ^(F(A)) t . 



<mxn 




H 3 (x) = J2H2(XKH 1 (X))y. 



March 3, 2013 



DRAFT 



7 



Recursive use of Theorem [J] implies that # 2 (A)(#i(A))) s G S% xn for all s > 0. Hence, # 3 (A) G 
S% xn , and the proof is complete. 

■ 

Remark. Theorem \T\ gives a more general invariance property than Quadratic Invariance lfl"3l 
in our case. We show that the structure of Hi(X) of H 2 (X) is preserved under multiplication, 
and Theorem |2] shows that H t (X) under negative feedback of H 2 (X), the structure of the closed 
loop H 2 (X)(I — Hi(X))^ 1 is preserved. For Quadratic Invariance, K(X)(I — G(X)K(X))~ 1 has 
the same structure as G(X) and K(X) if and only if K(X)G(X)K(X) has the same structure 
as K(X) and G(X). In our case, taking H 2 (X) = K(X) and H X (X) = G(X)K(X), then if G(A) 
and if(A) have the same structure, then so does G(X)K(X) = -Hi (A). It implies that both 
H 2 (X)H 1 (X) = K(X)G(X)K(X) and H 2 (X)(I - H^X))- 1 have the the same structure as G(X) 
and K(X). 

IV. Duality of Estimation and Control 
A. Distributed State Feedback Control 
Consider the interconnected systems 

N 

Xj(t + 1) = AijXj{t) + BnUi(t) + Wi(t) 

3=1 (3) 

Ui(t) = Xi(t), 

w(t) ~ jV(0, 7) for all t, and = for all t < 0. Without loss of generality, we assume that 
.Bjj has full column rank, for i = 1, N (and hence has a left inverse). 

The problem we are considering here is to find the optimal distributed state feedback control 

oo 

mit) = Kiiq-^xit) = ki(s)x(t - s), (4) 

s=0 

for i = 1, N that minimizes the quadratic cost 

1 - 

J(x, u) := lim — n\Cx(t) + Du(t) f, 

The partially nested information pattern is reflected in the parameters where fci 3 -(s) = 

if [^4 s ]ij = 0, and A G Z^ xjv is the adjacency matrix of the interconnection graph . Thus, the 
block sparsity structure of K(q^ 1 ) is the same as the sparsity structure of 

(/ - q-U)- 1 = / + ^q- 1 + A 2 q- 2 + ■■■ , 
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and so K(X) E S? xn . To summarize, the problem we are considering is: 

1 M N 
inf lim — V V E||^(t)|| 2 

subject to x(t + 1) = Ac(i) + Bu(t) + 
z (t) = Cx(t) + Du(t) 

B = &ag(B u ,...,B NN ) (5) 

oo 

u(t) = k(s)x(t — s) 

w (t) = x(t) = x(0) = for all t < 
w(t) ~ W(0, J) for all t > 

5. Distributed Feedforward Control 

The feedforward control problem is closely related to the state-feedback problem: 

M N 

inf lim _y^y^E||^(i)|| 2 

v ' -4 t=l i=l 

subject to x(t + 1) = Ax{t) + Bu(t) + 
B = diag(5ii, ...,B NN ) 

z(t) = Cx(t) + Du(t) (6) 

oo 

u(t) = -J29(s)w(t-l-s) 

s=0 

w(t) = x(t) = x(0) = for all t < 
~ jV(0, J) for all t > 

Note that © and © are not equivalent in general, since the latter only uses information about 
that external signals, w, entering the system, whereas for more restrictive informations structures, 
the control signals could carry information (see [16] and 0). 
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C. Distributed State Estimation 
Consider N systems given by 

x(t + l) = Ax(t) + Bw(t) 

(7) 

Vi(t) = CuXi(t) + Diw(t), 
for i = 1, ...,N, w(t) ~ J\f(0,I), and x(t) = for all t < 0. Without loss of generality, we 
assume that Cj has full row rank, for 2 = 1, N. The problem is to find optimal distributed 
estimators Xi(t) to minimize the cost 

^ M N 

J^mEEeh^)-^)!! 2 (8) 

t=l j=l 

In a similar way to the distributed state feedback problem, the information pattern is the partially 
nested, which is reflected by the interconnection graph, so L(X) E S^ xm . The linear decisions 
are optimal, hence we can assume that 



Xi(t) = Hq-^yit - 1) = l *( s )y(t " 1 " s). (9) 
Then, our problem becomes 



M N 



inf lim — V VEll^ft) - Xi(t)\\ 2 
subject to x(t + 1) = Ax(t) + Bw(t) 

y(t) = Cx(t) + Dw(t) 

C = di ag (C u ,...,C NN ) (10) 

oo 

x(t) = Y,l(s)y(t-l-s) 

s=0 

w (t) = x(t) = x(0) = for all t < 

w(t) ~ Af(0, 1) for allt > 
In the next section, we will show the connection between the three problems that were 
introduced in this section. 
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V. Duality Results 

A. Duality of State-Feedback and Feedforward Control 

Theorem 3: The problems © and © are in bijection. 
Proof: First write 

x(t) = (qi - A- BKicf 1 ))- 1 ™^) 

(11) 

= (I - Aq" 1 - BK(qr 1 )qT 1 y 1 qT 1 w(t). 

Then 

u (t) = -if (q- x )(i - AqT 1 - BKiqr^qr 1 )-^ - 1). 

Set Hi (A) = AA + BK(X)X and i? 2 (A) = —if (A). Since SA G S^ xm , Theorem Q] implies that 
BXK(X) G S% xn , and thus iix(A) G 5^ xn . Now applying Theorem E we get 

G(A) = -if (A)(i -AX- BK(X)X)- 1 G S% xn . 

In a similar way, we find that 

if (q- 1 ) = GCq- 1 )^ - Aq- 1 - £if (q-^q" 1 ) 

if (q- 1 ) + C7(q- 1 ) J B^(q- 1 )q- 1 = G?(q~ l )(J - Aq 1 ) 

t 

if (q" 1 ) = (i + C7(q- 1 ) J Bq- 1 )- 1 G(q- 1 )(J - Aq" 1 ) 

(12) 

= G^XI + Bq-^Cq" 1 ))-^/ - Aq" 1 ). 

Applying theorems [H and |2] to the generating function above shows that G(A) G S% Xn =>• 
if (A) G S™ xn . Hence, there is a bijection between the two controllers if and G, and the proof 
is complete. ■ 
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B. Duality of Distributed Estimation and Feedforward Control 

Theorem 4: Consider the distributed feedforward linear quadratic problem ©, with state space 
realization 



A 


/ 


B 


C 





D 





-I 






and solution u{t) = ~Yl^o9( s ) w (t ~ s )' X^o#(^* e S^ xn , and the distributed estimation 
problem (flOl) with state space realization 



A T 


C T 





I 





-I 


B T 


D T 






and solution x(t) = £~ l (s)y(t -3-1), E t =o ^) A * e S ^ m - Then ' for a11 s > = /T ( S )- 
Proof: Introduce an uncorrected Gaussian process w(t) ~ jV(0, J) with proper dimensions. 
For any transfer function F, we have that E||F(q -1 )it>(t)|| 2 = ||F(q _1 )|| 2 . Using this fact we 
see that each term in the quadratic cost of © can be written as 

E\\Cx{t) + Du(t)\\ 2 = E \\C{qI - A) -1 io(t) - [C(q/ - A)~ 1 B + D]G{q- 1 )q' 1 w{t)\\ 2 

= ||C(q/ - A)" 1 - [C(qJ - A)- 1 ^ + D]G(q- 1 )q- 1 || 2 

= ||(q/ - A 7 )- 1 ^ - G T (q- 1 )q- 1 [ J B T (q/ - A T y l C T + D T ] f 

= E ||(q/ - A T Y l C T w{t) - G T (q- 1 )q- 1 [ J B T (q/ - A T )- 1 C T + D T }w(t) 

(13) 

where the third equality is obtained from transposing which doesn't change the value of the 
norm. Introduce the state space equation 

x(t + 1) = A T x(t) + C T w(t) 

(14) 

y(t) = B T x(t) + D T w(t) 

and let 

x(t) = G T (q- 1 )y(t-l). 
Then comparing with (fl"3T ). we see that 

N 

E\\Cx(t) + Du(t)\\ 2 = E||x(t) - x(t)\\ 2 = ^E||xi(t) - ^(t)|| 2 . (15) 
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The solution of the control problem described as a feedforward problem, G(q _1 ) G S^ xn , is 
equal to L T (q _1 ) G S^ m , where L(q _1 ) is the solution of the corresponding dual estimation 
problem. ■ 
We have transformed the feedforward control problem to an estimation problem, where the 
parameters of the estimation problem are the transposed parameters of the control problem: 

A -H- A T 

B^C T 

(16) 

C ^ B T 
D -H- D T 

Note that we can have a distributed estimation problem with controller of the form u(t) = 
KiqT^yit-l) with K(X) g & 



pxm. 



A ■ 

- M N 

inf lim — > > Ellxj(i) — 

subject to x(t + 1) = Ac(t) + Biiw(t) + J B 2 M(f) 
= Car(t) + Dw(t) 
S 2 = diag(Bn,...,B JV jv) 

(17) 

C = diag(C u , ...,C N n) 



it) = J2Ks)y(t-i-s) 



s=0 

w (t) = x(t) = x(0) = for all t < 

w(t) ~ jV(0,7) for all t > 
By considering the controller u(t) as a propagating mean, the problem (fTTT ) is essentially the 
same as (TTOl) (compare with the centralized Kalman Filter). 

VI. The Optimal Controller and Estimator 

Since the distributed control and estimation problems are dual, we will show how to find the 
optimal distributed estimator (and hence the optimal distributed state-feedback controller by just 
transposing the optimal distributed estimator). In particular, we will present two examples of 
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three interconnected systems with both sparsity and delayed measurements. First we consider 
an acyclic graph and then a connected graph. Connected graphs possess a property of common 
information that is absent in acyclic graphs. Naturally, any graph can be written as clusters of 
connected graphs, interconnected over an acyclic graph, and these can be put together using our 
framework. 

A. Optimal Distributed Estimators 

Consider the estimation problem given by (fTOT ) (problem (fTTt can be treated similarly). It can 
be decomposed into N decoupled and centralized estimation problems according to 



inf 

h(s),s>0 M-hx) M 



1 M 

lim — > E\\xi(t) - 



t=i 



subject to x(t + 1) = Ax(t) + Bw(t) 
y(t) = Cx{t) + Dw(t) 
C = diag(C u , ...,C NN ) 

oo 

i (t) = ^T / l i (s)y(t-l-s) 



(18) 



.1 i 



0,8 > 

for all t < 



s=0 

li3(s) = Q if[A% 

w(t) = x(t) = x(0) 

w(t) ~ W(0, 1) for all t > 
for % = 1,...,N. By introducing the augmented vector of delayed measurements Y(t — 1) = 
(y(t — 1), y(t — 2), y(t — A/")), the optimal solution is the optimal Kalman filter with respect 
to a subset of blocks of the augmented vector Y(t — 1), which is defined by the structure of 

k(s),s>0. 

We will illustrate how to obtain a state-space solution to the optimal distributed filtering 
problem for the case of three interconnected systems over two different graphs. By duality 
(Theorems |3] and HJ, it is equivalent to finding the state-space solution for the distributed optimal 
control problem. The interconnection is defined by the system matrix 

A u A 12 A 13 
A = A 2 i A 2 2 A 2 z 

Azi ^32 ^33 
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First consider three interconnected systems over a chain, given by the state-space realization 



x^t + l) 




An 


A 12 







Xl(t) 




B l 




x 2 (t + l) 


= 





A 22 


A23 




x 2 (t) 


+ 


B 2 


w(t) 


x 3 {t + l) 










A 33 




x 3 (t) 




B 3 






yi(t) 




C n 










Xi(t) 




D 1 






V2{t) 







C22 







x 2 (t) 


+ 


D 2 


w(t) 




_ys(t)_ 










C33 




x 3 (t) 




D 3 





(19) 



The adjacency matrix of the interconnection is graph is given by A, and we have 



1 










1 


1 







1 


2 


1 




* 


* * 





1 





, A = 





1 


1 


, A> = 





1 


2 


, A s = 





* * 








1 










1 










1 







* 



where the stars stand for positive integers. The condition [A%j = =>- Uj(s) = implies 
that the information available to estimate Xi(t) is given by yi(t — 1 — s) for all s > (since 
[A s ]u = l,Vs > 0), y 2 (t -1-s) for all s > 1 (since [A% 2 = l,Vs > 1), and y 3 (t - 1 - s) 
for all s > 2 (since [A s ]i 3 — l,Vs > 2). The problem of estimating xi(t) based on information 
induced by the sparisity structure of ^4 can be written as a centralized estimation problem, by 
an algebraic lifting, with respect to the extended system dynamics 



Xi(t) 




A n 


A 12 

























Xi{t - 


i) 




B 1 


x 2 (t) 







A 22 


A 23 






















x 2 (t - 


i) 




B 2 


x 3 (t) 










A 33 






















x 3 (t - 


i) 




B 3 


yi(t-i) 




Cn 




























yi(t- 


2) 




D l 


y 2 (*-i) 







c 22 

























V2(t- 


2) 


+ 


D 2 


ys(t - 1) 










C33 






















ys(t - 


2) 




D 3 


yi(t-2) 













/ 



















yi(t- 


3) 























/ 
















V2(t- 


3) 







ys(t-2) 



















I 













ys(t- 


3) 
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Xi(t - 1) 








Xo(t - 1) 









x*(t - 1) 




o 




Vi(t - 2) 









v?(t - 2) 


+ 







v*(t - 2) 









yi(t-s) 









j/ 2 (t-3) 

















Cii 00000000 

00000000 

00000000 

00700000 

yl(t-l)= 000/0000 y 2 {t-2) + w(t - 1) (21) 

00000000 

00000/00 

000000/0 

0000000/ 

The optimal estimate of xi(t) based on the output y\(t — 1) can be obtained from the optimal 
estimate of x e (t) based on the output y\(t — 1). The computation of the optimal (Kalman) filters 
is routine and hence omitted here (consult e. g. [[T)). 

In a similar way, one can find the optimal estimates of x%{t) and x${t) based on the corre- 
sponding outputs y1(t — 1) and y^(t — 1). The information available to estimate X2(t) will be 
y 2 (t — 1 — s) for all s > (since [A s ]22 — l,Vs > 0), and y s (t — 1 — s) for all s > 1 (since 
[^ s ]23 = 1, Vs > 1). [-4 s ]2i = 0, Vs > 0, and hence, no measurements of y\ are available. Finally, 
the estimate x 3 (t) will be only based on y 3 (t — s — 1), s > 0, since [A s ]3j = for j — 1,2, and 
s > 0. 

Now modify the system matrix ^4 by letting the lower left block matrix A 31 ^ 0. This implies 
that we have a cycle of three interconnected systems. Now we get 



1 










1 


1 







1 


2 


1 




* * * 





1 





, A = 





1 


1 


, A 2 = 


1 


1 


2 


, A s = 


* * * 








1 




1 





1 




2 


1 


1 




* * * 



Vs > 2. 



Just as before, the stars stand for positive integers. Note that the information structure is 
symmetric (the interconnection graph is symmetric). Compare with the information structure 
over a chain. This is a fundamental difference between cyclic and acyclic graphs. For the cyclic 
ones, there is a common past (which is 3-steps delayed measurements in the three systems 
case above), whereas for the acyclic ones, this property is lacking. The property of common 
past has been used in [fT2ll . Nevertheless, the solution structure is the same using our approach, 
independent of the graphs being cyclic or not. 
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In general, we can write the dynamical system in (1181 ) as an extended system 



x e (t + 1) = A e x e (t) + B e w{t) 
y e {t) = Ex e {t) + D e w(t) 



(22) 
(23) 



where system i measures block component y l e (t). The optimal Kalman filter Lj(q x ) in the 
stationary case is given by 



x e (t + 1) = A e x e (t) + KivKt) + B e w{t) 
x e {t) = x e (t) - x e (t) 
yl(t) = Eix e (t) + [D e ]iw(t) 



(24) 
(25) 
(26) 



B. Discussion on the Optimal Distributed Controller Structure 

The optimal filter L(A) G S^ xm can be written in terms of its rows 

Li 



where is the optimal estimator of the state £«(£). Li has the state space realization: 



A e - KiEi 










with 







0/0 







(27) 



where the identity matrix I in Tj is in block position i, and K; L is the optimal Kalman gain. For 
instance, comparing with the problem of estimating Xi(t) subject to the extended system (|20l . 
we have 
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and 



A P 



For G = L T , we get 



£1 



A 


11 


A 12 




























A 22 


A23 


























A33 




















c 


11 































C22 































C33 































/ 































/ 































/ 













~C n 


































































































/ 

































/ 



































































/ 

































/ 

































/ 





G 



I 



tT jl 



Now let 



Wi 

w 2 



w 



W N 
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Then 



u(t) = -Gfa-^wit- 1) 



N 



i=i 



We can see that the controller can be written as the sum of N controllers, u(t) = J2iLi u i(t)> 
with Ui(t) = —FT(q~ 1 )wi(t — 1) as the the feedback law with respect to the disturbance u>j 
entering system i. Taking the transpose of (1271 ) gives the state space realization of Ff: 



(28) 













Let 

:= Zi(t+1) = A T eZi (t) + Ejui(t) + TiWi(t). 

It is easy to verify that iii(t) = —Kfzi(t) and Ui(t) = — F i T (q _1 )ty i (t — 1) are equivalent. Hence, 
the optimal distributed controller Ui(t) = — F^q^^w^t — 1) is equivalent to the state feedback 
controller, with respect to the mode generated by u>i(t), Wi(t — 1), for i = 1, 2, N. 



VII. Generalized Distributed Estimation 
Let W G and consider the weighted distributed estimation problem 



1 M 

inf lim —y"E(x(t)-x(t)) T W(x(t)-x(t)) 



L(A)eS^ xm Af^oo M 

subject to x(t + 1) = Ax(i) + Siw(t) 
y(t) = Cx(t) + Zto;(t) 

C7 = diag(C7 11 ,...,C7 JVi v) (29) 

oo 

x(t) = 53/(a)j/(t-l- S ) 

s=0 

«;(*) = x(t) = x(0) = for all t < 

w(t) ~ A/"(0, I) for all t > 
Note that the case W = I reduces to (flOi The matrix introduces coupling between the 
estimators, so the problem can't be solved through separation as in (fTQb ■ This problem has been 
solved for the continuous time case in [3j. We will give the discrete time analogue following 
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the same proof technique as in 0. It can be seen as an abstraction of the Kalman filter, where 
the projection theorem of linear algebra is used sequentially. 

First, write the dynamical system in (|29l ) on the form (|22l )-(l23l and introduce the extended 
linear dynamical system 



X(k + 1) = AX(k) + BW(k) 
Y(k) = CX(k) + VW(k) 



where 



X{k) = diag(a; e (fc), 


...,X e (k)) 


(30) 


Y{k) = diagiyXk), 




(3D 


W(Jfc) = diag(w(fc), 




(32) 


A = diag(A e , ... 


,A e ) 


(33) 


8 = diag(£ e ,... 


,B e ) 


(34) 


C = diag(i?i, ... 


, En) 


(35) 


D = diag([D e ]i, 




(36) 



Then, since W(k) is white noise, it follows that it is W^-white noise. According to Proposition 
[3] in Section HH we can equivalently consider the cost 

\\X(t)-S(Jt)\\w 

instead of 

E(x(t) - x(t)) T W(x(t) - x(t)) 

in the optimization problem (|29l ), where S(t) is a causal linear operator with column Si(t) 
depending only on the output measurements of controller i up to time t — 1, which we will call 
y^ 1 . Let S* be the space of all causal linear operators S(t) such that S{(t) depends only yl" 1 . 
Define X(t) and X(t) as 
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X(t) = axg mil t \\X(t)-S(t)\\ 2 w , 
Y(t)=CX(t), 

and 

X(t) = X(t)-X(t) 
Y(t) = Y(t) - Y(t) 

= CX(t)+VW(t). 

We have that X(0) = and X(0) = X(0). W{t) is orthogonal to the state history X(t),X(t - 
1), so it's VF-orthogonal to S*. Proposition [T] gives that X(t) is W^-orthogonal to S*, so Y(t) 
is VF-orthogonal to S*. In addition, it follows that Y(t) is W^-white noise. Now introduce 

X(t+1) = AX(t) + S(k). 

Then, 

\\X(t + 1) - X(t + 1)\\ 2 W = \\AX{t) + BW{t) - X(t + 1)\\ 2 W 

= \\AX(t)-X(t + l)\\ 2 w + \\BW(t)\\ 2 w 

= \\AX(t) + AX(t) - X(t + l)\\ 2 w + \\BW(t)\\ 2 w 

= \\AX(t)-S(t)\\ 2 w + \\Bm)\\w 
Now combining propositions |3] and @] we get 

K(t)Y(t) = arg min \\AX(t) - S(t)\\ 2 w , 
S(t)ex>*+i 

where 

K{t) = E{X(t)WY T (t)}(E{Y(t)WY T (t)})-\ 

Hence, we have that 
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X(t+1) = AX{t) + K{t)Y{t) 

= AX{t)+K(t)(Y(t)-Y(t)) 

= (A- K(t)C)X(t) + K(t)Y(t) (37) 
X(t) = (A- K(t)C)X(t) - K(t)W(t) + BW(t) 



and 



Then, the estimator (|371 ) can be written as N separate estimators with respect to the measurements 

Vi{t): 

Xtf + 1) = (A - K(t)C)Xi(t) + K(t)Yi(t). 

(38) 

Hence, the estimator X(t) can be implemented in a distributed manner. Finally, let Y = I n and 



r, 



••• ••• 



(identity matrix in block-position j) 



We obtain the optimal estimator x(t) from X(t) by using Proposition [3] with Lj = Tj. We 
conclude our result with the theorem below: 

Theorem 5: Consider the weighted distributed estimation problem (|29l ). Let X(Q) =0 and 

X(t + l) = (A- K(t)C)X(t) + K(t)Y(t) 

X(t + 1) = (A- K(t)C)X(t) - K(t)W(t) + BW(t) 

with 

K{t) = E{X(t)WY T (t)}(E{Y(t)WY T (t)})- 1 . 
Partition X in N blocks of n x n matrices [X],^. Then, the optimal estimator is given by 

N 
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VIII. Distributed State Feedback with Cross-Correlation in the Disturbance 

The distributed state feedback given by © considers the case where w(t) ~ A/"(0, 1), that is 
Wi(t) is uncorrelated with Wj(t) for i j. We will now consider a slightly different problem 
where w(t) ~ jV(0, W) for a general positive definite matrix W: 

1 a/ at 
inf lim — V V E||^(t)|| 2 

subject to ar(* + 1) = Ax(t) + Bu(t) + w(t) 
z (t) = Cx{t) + Du{t) 

B = dmg(B lu ...,B NN ) (39) 



(t) = J2k(s)x(t-s) 



u , 

s=0 

w (t) = x(t) = x(0) = for all t < 

w(t) ~ W(0, W) for all t > 
Following the same arguments as the proof of Theorem |3] we see that ([39] is equivalent to 
the feedforward control problem 

M N 

inf lim — VVE|k(i)|| 2 

G(A)e5 ^Xn M ^ M j^j^ 

subject to x(t + 1) = + Stt(t) + w(t) 

B = diag(Bn, ...,B NN ) 

z(t) = Cx(t) + Du(t) (40) 

oo 

u(t) = -J2g(s)w(t-i-s) 

s=0 

w(t) = x(t) = x(0) = for all t < 

w(t) ~ jV(0, W) for all t > 
where the only change is in that w(t) ~ A/"(0, W). It is also straightforward to apply the proof of 
Theorem @] to show that the dual of (l40l is given by the weighted distributed estimation problem 
(1291 . since the dynamics can be written as 

x(t + 1) = Ax(t) + Bu{t) + W*w(t) 
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where w(t) ~ J\f(0,I) is white noise. We have already seen that problem (|29l ) is conceptually 
more general for general weight matrices W. Therefore, correlation in the disturbance for 
distributed state feedback control changes the problem substantially. 

IX. Conclusion 

We showed that distributed estimation and control problems are dual under partially nested 
information pattern using a novel system theoretic formulation of dynamics over graphs. We 
showed that the distributed estimation problem can be decomposed into iV separate problems that 
are easy to solve, and hence solve the corresponding distributed control problem due to the duality 
that was shown in this paper. We considered a distributed estimation problem formulated as a 
dynamical team problem. We proposed a solution based on linear quadratic team decision theory, 
which provides a generalized Riccati equation for teams. We also showed that the weighted 
estimation problem is the dual to a distributed state feedback problem, where the disturbances 
entering the systems are correlated, and hence, a solution is obtained based on generalized Riccati 
equation for teams. 
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Appendix 

A. Graph Theory 

A (simple) graph Q is an ordered pair Q := (V, 8) where V is a set, whose elements are called 
vertices or nodes, 8 is a set of pairs (unordered) of distinct vertices, called edges or lines. The 
set V (and hence 8) is taken to be finite in this paper. A loop is an edge which starts and ends 
with the same node. 

A directed graph or digraph Q is a graph where 8 is a set of ordered pairs of vertices, called 
directed edges, arcs, or arrows. An edge e = (v i: Vj) is considered to be directed from Vi to vy, 
Vj is called the head and Vi is called the tail of the edge. 

The adjacency matrix of a finite directed graph Q on n vertices is the n x n matrix where the 
nondiagonal entry Aij is the number of edges from vertex j to vertex i, and the diagonal entry 
An is the number of loops at vertex i (the number of loops at every node is defined to be one, 
unless another number is given on the graph). 
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